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Abstract 

Motivated by call center practice, we propose a tractable model for GI/GI/n + GI queues in 
the efficiency-driven (ED) regime. We use a one-dimensional diffusion process to approximate 
the virtual waiting time process that is scaled in both space and time, with the number of 
servers and the mean patience time as the respective scaling factors. Using this diffusion model, 
we obtain the steady-state distributions of virtual waiting time and queue length, which in 
turn yield simple formulas for performance measures such as the service level and the effective 
abandonment fraction. These formulas are generally accurate when the mean patience time is 
several times longer than the mean service time and the patience time distribution does not 
change rapidly around the mean virtual waiting time. For practical purposes, these formulas 
outperform existing results that rely on the exponential service time assumption. 

To justify the diffusion model, we formulate an asymptotic framework by considering a se¬ 
quence of queues, in which both the number of servers and the mean patience time go to infinity. 
We prove that the space-time scaled virtual waiting time process converges in distribution to the 
one-dimensional diffusion process. A fundamental result for proving the diffusion limit is a func¬ 
tional central limit theorem (FCLT) for the superposition of renewal processes. We prove that 
the superposition of many independent, identically distributed stationary renewal processes, 
after being centered and scaled in space and time, converges in distribution to a Brownian mo¬ 
tion. As a useful technical tool, this theorem characterizes the service completion process in 
heavy traffic, allowing us to greatly simplify the many-server analysis when service times follow 
a general distribution. 


1 Introduction 


Queues with many parallel servers are building blocks for modeling call center operations; see Gans 


et al. (2003) and Aksin et al. (2007) for comprehensive reviews. A large call center faces a great 


amount of traffic that is stochastic and time-varying. Since the rate of incoming calls changes over 
time, the system may become overloaded during peak hours. Waiting on a phone line, a customer 
may hang up before being connected to an agent. This phenomenon, referred to as customer 
abandonment, is present in almost all call centers and becomes prominent when the system is 
overloaded. A call center may also be intentionally operated in an overloaded regime. Nowadays, 
more and more firms outsource their call centers to save costs. In service-oriented call centers. 
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staffing costs usually dominate the expenses of customer delay and abandonment. As pointed out 
by Whitt (2006), the rational operational regime for these call centers is the efficiency-driven (ED) 
regime that emphasizes capacity utilization over the quality of service. In the ED regime, the 
service capacity is set below the customer arrival rate by a moderate fraction. Because the lost 
demands of abandoning customers compensate for the excess of customer arrivals over the service 
capacity, a call center operated in the ED regime can still achieve reasonable service quality. More 
specihcally, the mean customer waiting time is comparable to the mean service time, a moderate 
fraction of customers abandon the system, and all agents are almost always busy. 

Service requirements for an outsourced call center are specified in the service-level agreement 
(SLA) between the firm and the call center provider. The SLA includes performance objectives 
such as the average service time, the acceptable abandonment rate, and the acceptable customer 
delay time. One important service level objective is a specified percentage of customers to be 
served within a given delay, e.g., “80% of calls should be answered within one minute.” Based on 


this service level, Baron and Milner 

(2009 

) studied SLA design using the M/M/n -|- M model, and 

Mandelbaum and Zeltyn ( 

2009 

) studied call center staffing using the M/M/n -|- GI model. Despite 


service time distribution is not a realistic assumption for call center customers. Therefore, these 
models may not be able to provide adequate estimates for the performance measures required by 
the SLA. Call center managers may need a more accurate but still tractable model for performance 
analysis and staff deployment. 

For queues in the ED regime, a fluid model proposed by Whitt (2006) is useful for estimating 
several performance measures, including the fraction of abandoning customers, the mean queue 
length, and the mean virtual waiting time. In the M/M/n -|- GI setting, the accuracy of the fluid 
model was studied by Bassamboo and Randhawa (2010). They proved that in the steady state, 
the accuracy gaps of fluid approximations for the mean queue length and the rate of customer 
abandonment do not increase with the arrival rate. As a deterministic model, however, the fluid 
model cannot be used to predict the percentage of customers to be served within a time limit. A 
refined model is thus necessary for estimating such a measure. 

The focus of this paper is a diffusion model for many-server queues in the ED regime. Both 
the service and patience time distributions are assumed to be general. Using this diffusion model, 
we obtain the steady-state distributions of virtual waiting time and queue length, which in turn 
yield approximate formulas for performance measures such as the service level mentioned above. 
These formulas are able to produce accurate estimates, especially when customer patience times are 
relatively long compared with their service times. Empirical studies suggest that this requirement is 
realistic for service-oriented call centers. For example, it was reported by Mandelbaum et al. (2001) 
and Mandelbaum and Zeltyn (2013) that in the call center of an Israeli bank, the mean customer 
patience time was several times longer than the mean service time. By numerical experiments, we 
demonstrate that for practical purposes, this diffusion model outperforms existing models that rely 


on the exponential service time assumption; see Section 4.2 


The general service time assumption is a major challenge in the analysis of many-server queues. 
In the literature, the studies of many-server queues with a general service time distribution usually 
involve the analysis of one or several infinite-dimensional processes, which are used for tracking 
customer age or residual times. The resulting approximate models are also infinite-dimensional, 
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typically in the form of two-parameter or measure-valued processes; see, e.g.. 

Whittl ( 

2006) 

Kang 

and Ramanan 

(2010 , 

Kaspi and Ramanan 

(2011) 

Zhang 

(2013), and Kaspi and Ramanan i 

2013). 


Because these approximate models are either deterministic or too complex to be used for estimating 
a distribution, explicit formulas have been absent from the literature for the steady-state virtual 
waiting time and queue length distributions. Estimation of these distributions relies heavily on 
simulation; see, e.g., Blanchet and Lam (2014). The performance formulas provided in this paper 
can fill this gap for queues in the ED regime. 

We use a one-dimensional Ornstein-Uhlenbeck (OU) process to approximate the virtual waiting 
time process. We obtain this diffusion model by scaling a many-server queue in both space and time, 
and then replacing the centered and scaled arrival, service completion, and abandonment processes 
with mutually independent Brownian motions. Depending on the service time distribution through 
the first two moments, this model allows us to obtain Gaussian approximations for the steady- 
state virtual waiting time and queue length distributions. In contrast, the approximate models in 
the literature are derived by scaling many-server queues in space only. As a result, the general 
service time distribution needs to be incorporated in the approximate model, leading to an infinite¬ 
dimensional Markovian representation. It is well known that when a many-server queue is critically 
loaded, the system performance depends on the entire service time distribution and differs from 
that of a queue with one or several servers significantly; see, e.g. 


Dai et al. ( 

2010 

), 

Mandelbaum 


and Momcilovic (2012), and Kaspi and Ramanan (2013). Erom a macroscopic perspective in both 


space and time, we demonstrate that in the ED regime, the dynamics of a many-server queue could 
be as simple as that of a single-server queue, even though the service time distribution is assumed 
to be general. A one-dimensional diffusion process, depending on the service time distribution only 
by its hrst two moments, may suffice to capture the dynamics of the many-server system. (One may 
also refer to He (2013), an earlier version of this paper, where we proved a common diffusion limit 
for systems with an exponential patience time distribution, with either a single or many servers in 
the overloaded regime, using a different method than the one in this paper.) Such a simple model 
is much more attractive for analysis and control purposes. 

The diffusion model and the performance formulas are rooted in the limit theorems presented 
in Section]^ For queues in the ED regime, Dai et al. (2010) proved a multi-dimensional diffusion 
limit for the Gl/Ph/n -|- M model, and Huang et al. (2014) proved a one-dimensional diffusion 
limit for the GI/M/n -|- GI model. In addition, Huang et al. (2014) applied the obtained diffusion 
limit to delay announcement in call centers. In this paper, we consider a sequence of GI/GI/n-|-GI 
queues indexed by the number of servers n, and assume that the mean patience time goes to infinity 
as n goes large. In this asymptotic framework, the virtual waiting time process of each queue is 
scaled in both space and time, with the number of servers and the mean patience time being the 
respective scaling factors. The joint scaling scheme is essential to obtain a Brownian approximation 
for the service completion process, eventually leading to a one-dimensional diffusion limit. In the 


previous studies such as Dai et al. (2010), Mandelbaum and Momcilovic (2012), and Kaspi and 


Ramanan (2013), the queueing processes are scaled by the number of servers only. In this case, 


only when the service time distribution is exponential, will the scaled queueing process converges 
to a one-dimensional diffusion process. To the best of our knowledge, Theorem in this paper 
is the first rigorous result that identifies a one-dimensional diffusion limit for many-server queues 
with a general service time assumption. 
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The technique of joint seating in space and time was adopted by Whitt (2003, 2004), Gurvich 


(2004), and Atar (2012) for many-server queues with an exponential service time distribution. Whitt 


(2004) considered the M/M/n/r + M modei and proved that in the ED regime, the queue iength 
process has a diffusion iimit when the product of the number of servers and the mean patience 
time goes to infinity. A criticaiiy ioaded regime, known as the nondegenerate siowdown regime, 
was studied by Whitt (2003), Gurvich (2004), and Atar (2012). In this regime, the diffusion limit 
for the queue length process is proved to be either a reflected OU process when the patience time 
distribution is exponential, or a reflected Brownian motion when there is no customer abandonment. 
Because of the exponential service time assumption, it is certain that those diffusion limits are one¬ 
dimensional. The ability of space-time scaling to simplify the analysis of many-server queues is 
barely manifested by those papers. In this sense, it is the general service time assumption that 
distinguishes our work from others. By means of space-time scaling, we provide a paradigm for 
building a tractable model for many-server queues with a general service time distribution. 

Proving the diffusion limit requires new fundamental tools. Theorem in this paper is a 
functional central limit theorem (FCLT) for the superposition of renewal processes. We prove that 
the superposition of n independent, identically distributed (iid) stationary renewal processes, after 
being centered and scaled in space and time, converges in distribution to a Brownian motion as 
n goes large. For a many-server system in heavy traffic, the space-time scaled service completion 
process is characterized by this theorem, which allows us to bypass the analysis of the queue’s 
infinite-dimensional state process. To apply this theorem, we consider a sequence of perturbed 
systems that are asymptotically equivalent to the original queues. We assume that servers in a 
perturbed system are always busy so that the service completion process is the superposition of 
n renewal processes. The simplified dynamics of the perturbed system enable us to follow the 
procedure in Huang et al. (2014), proving the diffusion limit by a continuous mapping approach. 

We would summarize the contributions of this paper as follows. First, the explicit formulas 
obtained from the diffusion model are practical tools for performance estimation and staff deploy¬ 
ment in efficiency-driven service systems. In spite of their simple expressions, these formulas are 
superior to the widely used formulas relying on the exponential service time assumption. Second, 
we establish limit theorems to justify the diffusion model. By means of space-time scaling, we prove 
a one-dimensional diffusion limit for many-server queues with a general service time distribution. 
This joint scaling approach is useful for building tractable models for queues in the ED regime. 
Third, we prove an FGLT for the superposition of renewal processes. It is a fundamental result that 
characterizes the service completion process of a many-server queue in heavy traffic. This theorem 
enables us to bypass the infinite-dimensional analysis in proving the diffusion limit. 

The remainder of the paper is organized as follows. The diffusion model and the performance 
formulas are introduced in Section The limit theorems for the diffusion model are presented 
in Section We examine and discuss the approximate formulas by numerical experiments in 
Section Section is dedicated to the proof of the diffusion limit for the virtual waiting time 
process, and Section presents the proof for the queue length limit. The paper is concluded in 
Section We leave the proof of the FCLT and the proofs of technical lemmas to the appendix. 

Let us close this section with frequently used notation. The space of functions / : M+ —)• that 
are right-continuous on [0, oo) and have left limits on (0, oo) is denoted by (with D = D^), which 
is endowed with the Skorokhod Ji topology (see, e.g., Billingsley (1999)). For / G D, we use f{t—) 
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to denote the left limit of / at t and A/(t) the increment at t, i.e., A/(t) = — For T > 0, 

we use jQ^|d/(t)| to denote the total variation of / over [0,T]. For /' G D that is nondecreasing 
and takes values in M+, f o f is the composed function in D, i.e., (/ o f'){t) = We use e 

for the identity function on M_|_ and x the constant one function on M+, i.e., e{t) = t and xi^) = 1 
for t > 0 . 


2 Diffusion model and performance formulas 


Consider a GI/GI/n + GI queue, whose customer arrival process is a renewal process and service 
times are iid nonnegative random variables. Customers are served by n identical servers. Upon 
arrival, a customer gets into service if an idle server is available; otherwise, he waits in a buffer with 
inhnite room. Waiting customers are served on the first-come, hrst-served basis, and the servers 
are not allowed to idle if there are customers waiting. Each customer has a random patience time. 
When a customer’s waiting time exceeds his patience time, the customer abandons the system 
without being served. The patience times are iid nonnegative random variables, and the sequences 
of interarrival, service, and patience times are mutually independent. 

Let A be the customer arrival rate and // the service rate of each server. The traffic intensity 
satishes p = Xj{np) > 1 . When a many-server queue becomes overloaded, all servers will be almost 
always busy, so the fraction of abandoning customers will be around 


a = 


p-l 


P 


( 2 . 1 ) 


Assume that both interarrival times and service times have finite variances, with squared coefficients 
of variation and c^, respectively. Let 0 be the distribution function of patience times and 7 the 
mean patience time. Assume that 0 is absolutely continuous with a bounded, strictly positive 
density function /©. 

Suppose that at time t > 0, a hypothetical customer with infinite patience arrives at the queue. 
Let W{t) be the virtual waiting time at t, i.e., the amount of time this hypothetical customer has 
to wait before getting into service. As the queue comes into the steady state, the virtual waiting 
time process fluctuates around a mean level w that can be determined as follows: Because Q{w) is 
the fraction of customers whose patience times are less than w, it should be approximately equal 
to the fraction of abandoning customers. Then, Q{w) = {p — l)/p, which yields 


= ( 2 . 2 ) 

To represent the fluctuation of the virtual waiting time process around this equilibrium level, we 
introduce a centered and scaled version of W by 


W{t) = ^{W{xt)-w). 

We refer to W as the diffusion-scaled virtual waiting time process. To obtain this process, we scale 
the virtual waiting time process in both space and time after removing the mean w. Besides the 
commonly used scaling in space by the number of servers, we also change the time scale of the 
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process by using the mean patience time as the scaling factor. 

We approximate the diffusion-scaled virtual waiting time process by an OU process, which is 
given by the following stochastic differential equation 

W{t) = W{0) + M{t) — p'jfQ{w) f W{u)du for t > 0. 

Jo 

Here, M is a driftless Brownian motion with M(0) = 0 and variance 


-2 _ci + pci + p-l 

pp 


The OU process is a reasonable approximate model because the virtual waiting time process is 
mean-reverting: As the virtual waiting time fluctuates around tc, the instantaneous abandonment 
rate from the buffer fluctuates accordingly. If the probability density function of patience times does 
not change much around w, the relative variation in the abandonment rate will be approximately 
proportional to the relative variation in the virtual waiting time. When the virtual waiting time is 
either too long or too short, the increased or decreased abandonment rate will pull it back toward 
the mean level w. 

For the diffusion approximation to be accurate, the mean patience time 7 , serving as the scaling 
factor in time, should be relatively long compared with the mean service time. This requirement 
can be justified by Theorem where the ED regime is formulated into an asymptotic framework 
and W is proved to be the limit of a sequence of diffusion-scaled virtual waiting time processes. 
Although the mean patience time goes to infinity in this asymptotic framework, the diffusion model 
may still produce accurate performance estimates when it is just several times longer than the mean 
service time. We will discuss the influence of the mean patience time in Section |4.1[ 


The OU process is strongly ergodic with a Gaussian steady-state distribution (see, e.g., Karlin 
and Taylor (|l981 )). More specifically, the steady-state distribution of IT has mean zero and variance 


.2 _ Ca + PG + P- 1 

O',,, — 




Let W ( 00 ) be the virtual waiting time in the steady state and W ( 00 ) be the diffusion-scaled version. 
Because W is an approximation of W, their steady-state distributions are expected to be close, i.e.. 


P[IU(cx)) > a] 


1 — <I> ( — I for a G 

^ (T7/ 


(2.3) 


where is the standard Gaussian distribution function. Consequently, the steady-state virtual 
waiting time approximately follows a Gaussian distribution with mean w and variance 


^2 _ 


Cg + PCs + P - 1 
2np‘^pfe{w) 


(2.4) 


The percentage of customers to be served within a specified delay is referred to as the service 
level in practice. Let C be a random variable that has distribution 0 and is independent of lU(oo). 
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The service level within d > 0 can be approximated by 


poo 

P[TT(oo) < C A d] « / <1 >iu(m A d)/e(tt) dn 


(2.5) 


where 




[u 


= < 1 > 


p{u - w)y/2npfe{w] 


In many service systems, the fraction of abandoning customers whose actual waiting times exceed a 
short delay is an important measure of the quality of service and customer satisfaction. We refer to 
this fraction as the effective abandonment fraction because it excludes those abandoning customers 
whose effort of waiting is insignificant. For queues in the ED regime, the fraction of abandoning 
customers out of those whose waiting times exceed d> 0 can be approximated by 


P[C < IF(oo) I C A IT(oo) > d] 


/“(I - <^>^{u))fe{u) du 
(l- 0 (d))(l-a>^(d)) • 


( 2 . 6 ) 


Note that when d > w, we cannot use the fluid model by Whitt ( 2006[ ) to estimate this fraction. 
This is because the steady-state virtual waiting time is equal to w in the fluid model, by which the 
estimate of A IT(oo) > d] must be zero for d > w. 

We are also interested in the distribution of the steady-state queue length. For 0 < u < w, the 
probability that a customer who arrived u time units ago is still waiting in the buffer is around 
1 — 0(u). The mean queue length (i.e., the mean number of customers in the buffer) can thus be 
approximated by 

pw 

Q= A(l —0(u))du. (2-7) 

Jo 

Let X{t) be the number of customers in the system at time t, which fluctuates around n -|- g as the 
queue comes into the steady state. A centered and scaled version of X is defined by 

X{t) = —^(X( 7 t) -n-q). 

We refer to X as the diffusion-scaled queue length process. 

Let X(oo) be the number of customers in the steady state and X(oo) be the diffusion-scaled 
version. Theorem]^ in Sectionimplies that X((X)) approximately follows a Gaussian distribution 
with mean zero and variance 

~2 + pet + P - 1) , PP r CM W A , r 1 

a^ = - ^ ' -^-/ 0 (u)(l - 0 (n)) dn H-/ (1 - 0 (u)) du, 

2p^-ffe{w) 7 Jo 7 Jo 


i.e.. 


P[A(oo) > a] Ri 1 — <L( — ) for a G M. ( 2 . 8 ) 

V (Jx ' 

Hence, the steady-state number of customers approximately follows a Gaussian distribution with 


7 

















mean n + q and variance 

^2 ^ ^ —ll + np/i/ Q{u){l-Q{u))du + npncl [ {I - e{u))'^ du. (2.9) 

2p^fe{w) Jo Jo 

3 Limit theorems 


We present the underlying theorems of the diffusion model in this section. To formulate the ED 
regime, let us consider a sequence of G/GI/n + GI queues indexed by the number of servers. We do 
not require the arrival processes to be renewal, but simultaneous arrival of two or more customers 
is not allowed. In each queue, the number of initial customers, the arrival process, the sequence of 
service times, and the sequence of patience times are mutually independent. All these queues have 
the same traffic intensity p > 1 and the same service time distribution. Since the service rate p 
is identical in all systems, the arrival rate of the nth system is \n = npp. Assume that the mean 
patience time goes to infinity as n goes large, i.e., 

7 n —>■ oo as n —)• oo. (3-1) 

We do not require any assumption on the increasing rate of 7 ^ towards infinity. Because the 
patience time distribution changes with n, it is necessary to define a normalized patience time 
distribution for all queues. Let H be the distribution function of a nonnegative random variable 
with mean one, i.e., f^udH(u) = 1. Assume that ff is absolutely continuous with a bounded, 
strictly positive density function fn, i.e., there exists some k > 0 such that 

0 < fniu) < K for u > 0. (3-2) 


Using this normalized distribution, we define the distribution function of patience times in the nth 
system by 

Qn{u) = H(—'] for n > 0. (3-3) 


Glearly, the mean patience time in the nth system is equal to 7 ^. By (2.2), the mean virtual waiting 
time in the nth system is Wn = 0^^((p — l)/p). We may thus define the normalized mean virtual 
waiting time by 


which satisfies w = Wnl^n for all n G N. 

Let F be the distribution function of service times. We impose a mild regularity condition on 
E, which is 

limsup -{F{u) — F{0)) < 00 , (3.4) 


and assume that the third moment of F is finite, i.e.. 



dF{u) 
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< 00 . 


(3.5) 





Let Fe be the equilibrium distribution of F, given by 


Fe{t) = f (1 — F{u)) du for t > 0. 

Jo 

We assign service times to customers according to the following procedure. Let {^j^k ■ G N} 
be a double sequence of independent nonnegative random variables. For each j G N, follows 
distribution Ff, and follows distribution F for k > 2. In the nth system, assume that all n 
servers are busy at time zero. For j = 1,... ,n, is assigned to the initial customer served by 
the jth server as the residual service time at time zero. For k > 2, is the service time of the 
A:th customer served by the jth server. By this assignment, for all j, /c G N, the kth service time by 
the jth server is identical in all systems that have at least j servers. 

Let En{t) be the number of arrivals in the nth system during time interval (0,t]. Dehne the 
diffusion-scaled arrival process En by 

Enify — , {,En{,1'nJJ) 

Let be a renewal process whose interrenewal times have mean one and variance c^. If En is a 
renewal process with En{t) = N{Xnt), it follows from the FCLT for renewal processes that 

En ^ E as n —)• oo, (3-6) 

where .E is a driftless Brownian motion with E{0) =0 and variance pfLC^. To allow for more general 
arrival processes, we take the convergence in ( |3.6[ ) as an assumption rather than require each En 
to be a renewal process. 

Let Wn{t) be the virtual waiting time at t in the nth system, whose diffusion-scaled version is 


Wn{t) 



{Wni'ynt) - 'JnW). 


Assume that there exists a random variable W (0) such that 


Wn{0)^W{0) as n —)■ oo. 


(3.7) 


The first theorem states the diffusion limit for the virtual waiting time process in the ED regime. 

Theorem 1. Assume that the sequence of GjGIjn -|- GI queues described above has a eommon 
traffic intensity p> 1 and satisfies conditions ( |3.1[ )-(3.7). Then, 

Wn W as n ^ oo, 

where W is the OU proeess given by the following stochastic differential equation 


W{t) = W{0)+M{t)-pfH{w) W{u)du fort>0. (3.8) 

Jo 

Here, M is a driftless Brownian motion with M(0) = 0 and variance (c^ -|- + P — ^)/{pp)- 
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Remark. Let h be the hazard rate function of H, i.e., h{u) = fH{u)/{l — H{u)) for u > 0. Because 
H{w) = {p — l)/p, we have h{w) = p/niw). The diffusion limit depends on the interarrival and 
service time distributions through their first two moments, and depends on the normalized patience 
time distribution through the hazard rate at w. This is because by centering and space-time scaling, 
the arrival and service completion processes are replaced by Brownian motions in the limit process. 
Since the virtual waiting time process fluctuates around the equilibrium level, the influence on the 
scaled abandonment process is mostly dictated by the normalized patience hazard rate at w. 

The mean queue length in the nth system can be computed by 

j-w 

qn= A„(l - 0 „(m)) dn = n 7 n/ 0 /i / {l-H{u))du. (3.9) 

Jo Jo 

Let Xn{t) be the number of customers at time t > 0 in the nth system. Then, the diffusion-scaled 
queue length process is given by 

Xn{t) = -n- qn). (3.10) 


Put 


Wn{t) 


— Wni'^nt), 

In 


(3.11) 


which is the virtual waiting time at t in the nth time-scaled system. For a given t > 0, the second 
theorem concerns the limit of the diffusion-scaled queue length at time t + Wn (t) ■ 

Theorem 2. Under the conditions of Theorem^ for any fixed t >0, 


Xn{t + Wn{t)) ^ pW{t) + G{t) as n —>• oo. 


where W is the OU process defined by (3.8) and G{t) is a Gaussian random variable with mean 
zero and variance 


^9 = 


rw rw 

pp / H{u){l — H{u)) du + ppc^^ / {1 — H{u))‘^ du. 
Jo Jo 


In addition, W{t) and G{t) are mutually independent. 

Remark. Let t go to infinity. Then, pW(t) + G{t) converges in distribution to a Gaussian random 
variable with mean zero and variance 


(Trn - 


Ujc l + pc^,+p-l) 
‘Jp‘^fH{w) 


+ PP H{u){l — H{u)) du + ppc^ / {1 — du. 


When n is large, the distribution of Xn{oo) should be close to this Gaussian distribution, which 
leads to formula (2.8). 


The third theorem plays an essential role in proving the previous two theorems. This theorem 
is an FCLT for the superposition of time-scaled, stationary renewal processes, which are defined as 
follows. For t > 0 and j G N, let 


Nj{t) = max{A: > 0 : H-h < t}, 
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where {^j^k ■ j,k £ N} is the double sequence of random variables defined earlier. If > t, we 
take Nj{t) = 0 by convention. Because follows distribution Fg and ^j^k follows distribution F 
for k >2, {Nj : j G N} is a sequence of iid stationary renewal processes. 


Theorem 3. Let {Nj : j G N} be a sequence of iid stationary renewal processes, i.e., the delay 
distribution F^ of each renewal process is the equilibrium distribution of the interrenewal distribution 
F. Assume that F, having mean l/p, and squared coefficient of variation c^, satisfies (3.4) and 
(3.5). Let 


i=i 


(3.12) 


and {7n : n G N} be a sequence of positive numbers such that jn —t oo as n ^ oo. Then, 


Bn => B as n ^ oo. 


where 


Bn{t) 


1 


(Bnffint) - nHJnt) 


and B is a driftless Brownian motion with B{0) = 0 and variance jac^ 


(3.13) 


Remark. To better understand Theoremj^ let us compare this result with two other FCLTs. By the 
FCLT for renewal processes, {{Ni{lt) — ; t > 0} converges in distribution to a Brownian 

motion as ^ goes to infinity; see, e.g., Theorem 5.11 in Chen and Yao (2001). Clearly, the increments 
of this time-scaled renewal process become independent of its history as the scaling factor gets large. 


Whitt (1985) proved an FCLT for the superposition of stationary renewal processes. It states that 


~ ■ t > 0} converges in distribution to a zero-mean Gaussian process that 

has stationary increments and continuous sample paths. In this FCLT, the superposition process 
is scaled in space only. The covariance function of each stationary renewal process is retained in 


the limit Gaussian process, which, in general, is not a Brownian motion; see Theorem 2 in Whitt 


(1985). In our theorem, each superposition process is scaled in both space and time. Squeezing 
the time scale erases the dependence of the increments of to the history. The limit of these 
space-time scaled superposition processes should thus be a Gaussian process with independent, 
stationary increments and continuous sample paths, which must be a Brownian motion. 


In the ED regime, all servers are nearly always busy, so the service completion process is almost 
identical to the superposition of many renewal processes. Theorem implies that the space-time 
scaled service completion process can be approximated by a Brownian motion, which allows us to 
bypass the analysis of the infinite-dimensional age or residual process in proving a limit process. 
Hence, by zooming out our perspective in both space and time, we may obtain a one-dimensional 
diffusion model for many-server queues with a general service time distribution. 


4 Numerical experiments and discussion 

In this section, we examine the diffusion model by numerical experiments. We first study the 
influence of the mean patience time on the accuracy of approximation, and then use the diffusion 
model to solve a staffing problem. 
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Table 1: Performance estimates for the M/GI/IOO + M queue with /r = 1.0 and p = 1.2; simulation 
results (with 95% confidence intervals) are compared with approximate results (in italics). 


Patience 

Abd. fraction 

Virtual waiting time 
Mean Variance 

Queue length 
Mean Variance 



M/D/IOO + M 

7 = 

1.0 

0.1668 

0.1851 

0.005322 

20.02 

73.11 



±0.000020 

±0.000028 

±0.0000030 

±0.0034 

±0.038 



0.1667 

0.1823 

0.005000 

20.00 

70.00 

7 = 

5.0 

0.1667 

0.9142 

0.02639 

99.99 

364.1 



±0.000021 

±0.00014 

±0.00042 

±0.017 

±4.3 



0.1667 

0.9116 

0.02500 

100.0 

350.0 

7 = 

10 

0.1667 

1.826 

0.05487 

200.0 

749.2 



±0.000021 

±0.00030 

±0.000086 

±0.035 

±1.2 



0.1667 

1.823 

0.05000 

200.0 

700.0 



M/E 2 /IOO ± M 

7 = 

1.0 

0.1672 

0.1869 

0.007799 

20.07 

97.08 



±0.000040 

±0.000055 

±0.0000033 

±0.0062 

±0.041 



0.1667 

0.1823 

0.007500 

20.00 

95.00 

7 = 

5.0 

0.1666 

0.9152 

0.03812 

99.97 

481.0 



±0.000043 

±0.00031 

±0.000049 

±0.035 

±0.63 



0.1667 

0.9116 

0.03750 

100.0 

475.0 

7 = 

10 

0.1666 

1.827 

0.07567 

199.9 

956.4 



±0.000042 

±0.00058 

±0.00015 

±0.066 

±2.0 



0.1667 

1.823 

0.07500 

200.0 

950.0 



M/LN/100 ± M 

7 = 

1.0 

0.1679 

0.1890 

0.01049 

20.14 

122.2 



±0.000040 

±0.000052 

±0.0000043 

±0.0055 

±0.049 



0.1667 

0.1823 

0.01500 

20.00 

170.0 

7 = 

5.0 

0.1666 

0.9178 

0.06474 

99.97 

745.5 



±0.000043 

±0.00027 

±0.000068 

±0.029 

±0.73 



0.1667 

0.9116 

0.07500 

100.0 

850.0 

7 = 

10 

0.1666 

1.829 

0.1365 

199.9 

1563 



±0.000042 

±0.00054 

±0.00019 

±0.057 

±1.9 



0.1667 

1.823 

0.1500 

200.0 

1700 


4.1 Influence of the mean patience time 


Serving as the respective scaling factors in space and time, the number of servers and the mean 
patience time will affect how close the queue’s performance is to the diffusion approximation. By 
Theorem both scaling factors are required to approach infinity in order for the diffusion-scaled 
virtual waiting time process to converge. A diffusion model can generally produce satisfactory 


performance approximations when there are at least tens of servers (see, e.g., Garnett et al. ( 2002 ), 


Dai and He (2013), and Huang et al. (2014)). However, it is not immediately clear how large the 


mean patience time should be in order for the proposed diffusion model to be sufficiently accurate. 
We would thus evaluate the influence of the mean patience time on the accuracy of approximation. 

Assume that the queue has a Poisson arrival process with rate A = 120 and 100 servers with 
service rate p = 1.0, so the traffic intensity is p = 1.2. The patience time distribution is exponential 
with mean 7 = 1.0, 5.0, or 10. The service time distribution may be deterministic. Erlang (with 
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Table 2: Tail probabilities for the steady-state virtual waiting time and queue length in the 
M/GI/100 -|- M queue with /x = 1.0 and p = 1.2; simulation results (with 95% confidence intervals) 
are compared with diffusion approximations (in italics). 


Patience 

a = 0.5 

P[W(cx3) > a] 
a = 1.0 

a = 2.0 

a = 0.5 

P[X(oo) > a 
a = 1.0 

a = 2.0 



M/D/100-hM 

7 = 

1.0 

0.2584 

±0.00014 

0.09269 

±0.000078 

0.003869 

±0.000018 

0.2559 

±0.00014 

0.1131 

±0.000089 

0.01140 

±0.000031 

7 = 

5.0 

0.2505 

±0.0019 

0.08689 

±0.0016 

0.003138 

±0.00017 

0.2707 

±0.0013 

0.1200 

±0.0013 

0.01120 

±0.00031 

7 = 

10 

0.2539 

±0.00046 

0.09004 

±0.00023 

0.003419 

±0.000050 

0.2840 

±0.00049 

0.1252 

±0.00029 

0.01089 

±0.000093 



0.2398 

0.07865 

0.002339 

0.2750 

0.1160 

O.OO8414 



M/E2/100±M 

7 = 

1.0 

0.3007 

±0.00023 

0.1422 

±0.00015 

0.01596 

±0.000039 

0.2865 

±0.00023 

0.1472 

±0.00015 

0.02314 

±0.000044 

7 = 

5.0 

0.2884 

±0.00055 

0.1302 

±0.00039 

0.01215 

±0.000098 

0.2972 

±0.00056 

0.1523 

±0.00041 

0.02261 

±0.00014 

7 = 

10 

0.2859 

±0.00073 

0.1279 

±0.00054 

0.01151 

±0.00014 

0.3057 

±0.00074 

0.1538 

±0.00059 

0.02095 

±0.00021 



0.2819 

0.1241 

0.01046 

0.3040 

0.1525 

0.02009 



M/LN/100±M 

7 = 

1.0 

0.3288 

±0.00019 

0.1826 

±0.00014 

0.03577 

±0.000048 

0.3099 

±0.00017 

0.1774 

±0.00013 

0.03847 

±0.000048 

7 = 

5.0 

0.3348 

±0.00036 

0.1952 

±0.00026 

0.04389 

±0.00016 

0.3343 

±0.00035 

0.2040 

±0.00026 

0.05275 

±0.00016 

7 = 

10 

0.3371 

±0.00049 

0.1997 

±0.00035 

0.04703 

±0.00024 

0.3452 

±0.00049 

0.2118 

±0.00040 

0.05492 

±0.00024 



0.3415 

0.2071 

0.05124 

0.3507 

0.2216 

0.06252 


two stages), or log-normal, denoted by D, E 2 , and LN, respectively. With = 0 and 0.5, the 
deterministic and Erlang distributions are used to represent scenarios where service times have 
small variability. Brown et al. (2005) reported that a log-normal distribution provides a good fit 
for service time data from an Israeli call center. We also test such a distribution with = 2.0, 
which yields more variable service times. 

The estimates of several performance measures, including the fraction of abandoning customers, 
the mean and variance of the steady-state virtual waiting time, and the mean and variance of the 
steady-state queue length, are listed in Table We use (2.1), (2.2), (2.4), (2.7), and (2.9) to 
compute the respective approximate results. The formulas for the fraction of abandoning customers, 
the mean virtual waiting time, and the mean queue length are identical to those obtained from the 
fluid model by Whitt (2006). These fluid approximations agree with the simulation results very 
well, which is consistent with the conclusion drawn by Whitt (2006): The fluid model is able to 
produce accurate approximations for mean performance measures in the ED regime. 

The fluid model, however, cannot be used for estimating variances because of its deterministic 
nature. We need the diffusion model for completing this task. Theorems and imply that 
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Customer number 



(a) 7 = 1.0 


(b) 7 = 5.0 


Figure 1: The steady-state distribution of the number of customers in the M/H 2 /IOO -|- M queue 
with /i = 1.0, p = 1.2, and = 3.0; the exact distribution by the matrix-analytic method is 
compared with Gaussian approximation in (4.1). 


diffusion approximations are more accurate when the mean patience time is longer. Comparing the 
variance results in Table however, we can see that an adequate diffusion approximation may not 
require the mean patience time to be large. With a mean patience time that is comparable to the 
mean service time, the approximate variances are satisfactory when service times are deterministic 
or follow an Erlang distribution. This observation can be explained as follows. Because all servers 
are almost always busy, the service completion process is close to the superposition of iid renewal 
processes. As we discussed in the previous section, by squeezing the time scale, the increments 
of the service completion process will become more and more independent of the history. Then 
by Theorem we may use a Brownian motion to replace the space-time scaled service departure 
process to obtain the diffusion model. If the variability in service times is not significant, a moderate 
scaling factor in time is sufficient for the Brownian approximation to work well. Therefore, with 
a deterministic or Erlang service time distribution, the approximate variances are close to the 
simulation results even for 7 = 1.0. A larger scaling factor is necessary if the variability in service 
times is more considerable. When the service time distribution is log-normal with = 2.0, the 
approximate variances are no longer accurate for 7 = 1.0. In order for the increments of the time- 
scaled service completion process to be sufficiently independent of the history, the mean patience 
time should be at least several times longer than the mean service time. We can see that the 
approximate variances are satisfactory for 7 = 5.0 and 10. 

We also compute tail probabilities for the steady-state distributions of diffusion-scaled virtual 
waiting time and queue length. Approximations by (2.3) and ( 2 . 8 ) are compared with simulation 
results in Table the observation from which is consistent with what we found from Table With 
the deterministic or Erlang service time distribution, the approximate distributions are satisfactory 
when the mean patience time is comparable to or longer than the mean service time; when service 
times have a larger variance, the mean patience time needs to be at least several times longer than 
the mean service time, in order for Gaussian approximations to be accurate. 

To illustrate how the number of customers converges to a Gaussian random variable in the 
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Figure 2: The hazard rate of the hyperexponential distribution. 


steady state, we examine an M/H 2 /IOO + M queue whose service times follow a hyperexponential 
distribution with p = 1.0 and Cg = 3.0. There are two types of customers in the queue, and the 
service times of either type are iid exponential random variables. The fraction of the first type is 
59.16% and its mean service time is 0.1691; the fraction of the second type is 40.84% and its mean 
service time is 2.203. We are interested in this queue because the exact distribution of the steady- 


state number of customers can be computed by the matrix-analytic method (see, e.g., Latouche 


and Ramaswami (1999)). We may also approximate the distribution of X(oo) by 


P[X(oo) = i] « ——-—-) for i = 0 , 1 , 

CT 7 ’ V CT 7 ' / 


(4.1) 


where 0 is the standard Gaussian density function and q and ax are given by (2.7) and (2.9), re¬ 


spectively. We compare these two distributions in Figure [T] Although the Gaussian approximation 
cannot capture the exact distribution for 7 = 1.0, it becomes a good fit for 7 = 5.0. 

Through the above numerical examples, we can tell that both the variability in service times 
and the ratio of the mean patience time to the mean service time have influence on the accuracy 
of approximation. We would thus introduce the following quantity 

„ Cs 

-Rsvp = —, 

7P 

or the ratio of the standard deviation of service times to the mean patience time, as an index of 
accuracy when a queue is approximated by the diffusion model. This quantity is called the service- 
variability-to-patience ratio (SVPR). (Essentially, the SVPR is an index of how close the space-time 
scaled service completion process is to a Brownian motion.) By extensive numerical experiments, 
we observe that the diffusion approximations are generally accurate when i?svp <0.5. 


4.2 Staffing using the diffusion model 

In typical service-oriented call centers, the variability in customer service times is not significant and 
most customers are relatively patient when they are waiting for service. For example, by analyzing 


a set of operational data from an Israeli call center, Mandelbaum et al. (2001) reported that the 
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Figure 3: The service level (the percentage of customers who receive service within 120 seconds) 
in the M/LN/n + H 2 queue with A = 1.0, /r = 1/230, and the hyperexponential patience time 
distribution illustrat ed in Figure simulation results are compared with the estimates by (2.5) 
and the estimates by Zeltyn and Mandelbaum (2005). 


customer service times had moderate variability, ranging from = 1.54 to = 5.83 for different 
months, and that the mean patience time was about four times longer than the mean service time. 
Both moderate service variability and long customer patience suggest that the diffusion model is 
appropriate for modeling call centers in the ED regime. Next, let us study an application of the 
diffusion model to call center staffing. 

Consider an M/LN/n+ H 2 queue, which has a Poisson arrival process, a log-normal service time 
distribution, and a hyperexponential patience time distribution. There are two types of customers 
with different abandonment behaviors, and the patience times of either type are iid exponential 
random variables. We assume that 98% of customers have long patience times with mean 1000 
seconds and 2% of customers have short patience times with mean 6.0 seconds. We use such a 
distribution to represent a typical pattern of abandonment in call centers: While most customers 
would wait patiently for their service, a small fraction of customers would hang up within seconds 
if they cannot be served immediately. The hazard rate function of patience times is plotted in 
Figure [2] We adjusted the parameters in order for the hyperexponential distribution to imitate 
the patience time distribution in the call center of a large U.S. bank, whose operational data were 
analyzed by Mandelbaum and Zeltyn (2013). By checking Figure 2 in their paper, one can see that 
our hazard rate is just a “replica” of the smoothed hazard rate of their patience time data. 

In this queue, the service rate is taken to be /U = 1/230 per second, which is obtained from 


the estimated mean service time in the U.S. call center; see Figure 16 in Mandelbaum and Zeltyn 


(2013). Their paper, however, does not provide information about the variance of service time. 
We would consider two scenarios where the log-normal service time distribution has = 3.0 and 
= 5.0, respectively. 

Let the customer arrival rate be fixed at A = 1.0 per second. We would like to determine the 
minimum number of servers that is required for at least 80% of customers to receive service within 
120 seconds. The given delay in this example is around one half of the mean service time, which 
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Figure 4: The effective abandonment fraction (the fraction of abandoning customers out of those 
whose waiting times exceed 60 seconds) in the M/LN/n + H 2 queue with A = 1.0, = 1/230, 

and the hyperexponential patience time distribution illustrated in Figure simulation results are 
compared with the estimates by (2.6) and the estimates by Zeltyn and Mandelbaum (2005). 


is a reasonable requirement because customer waiting times should be comparable to service times 
in order for the queue to be in the ED regime. For any fixed number of servers, the service level 
within the given delay can be estimated by (2.5). We compare the service level estimates with 
simulation results in Figure and find good agreement in both scenarios. To reach the service 
level objective, the estimates from the diffusion model recommend 211 servers for = 3.0 and 213 
servers for = 5.0, while the simulation results recommend 211 and 212 servers, respectively. It 
can be seen that the estimation error is more apparent when = 5.0, which is consistent with the 
fact that this scenario has a larger SVPR. 


An existing method is also considered for the purpose of comparison. Zeltyn and Mandelbaum 


(2005) modeled a call center as an M/M/n + GI queue and studied performance approximations 


for this model. Their results are based on the analysis of a Markov chain studied by Baccelli and 


Hebuterne (1981), and some results are used for call center staffing in Mandelbaum and Zeltyn 


( |2009 ). The approximate distribution of the steady-state virtual waiting time can be found in 
(6.12) in their paper. This approximation turns out to be identical to our Gaussian approximation 
if we take = 1.0 in (2.3), so we may still use (2.5) to estimate the service level by their 

approximation. The resulting service level estimates are marked with “Z-M” in Figure 

The approximate results by Zeltyn and Mandelbaum (2005) are based on the assumption that 
both interarrival times and service times are exponentially distributed. Their estimates may not 
be accurate when these distributions are non-exponential. We find a wide discrepancy in Figure 
between the simulation results and their service level estimates. Because their approximation 
assumes = 1.0, the estimates suggest 208 servers in both scenarios. If this recommendation is 
adopted, the actual service level will be 74.4% for = 3.0 and 70.9% for = 5.0; in neither 
case can the service level objective be fulhlled. When the variability in arrival and service times 
is relatively large, one tends to underestimate the staffing level using the M/M/n -|- GI model, in 
which case the required service level can hardly be achieved. 
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The fraction of abandoning customers is also a common concern in call center staffing. This time 
we would like to determine the minimum number of servers in the M/LN/n + H 2 queue, such that 
out of those whose waiting times exceed 60 seconds, the fraction of abandoning customers is less than 
5%. Excluding customers abandoning the system within a short delay, this effective abandonment 
fraction is used more widely as a measure of customer satisfaction than the total percentage of 
abandoning customers. Figure compares the respective estimates by (2.6) and by Zeltyn and 


Mandelbaum (2005) with simulation results. As in the previous example, the estimates by (2.6) 


agree well with the simulation results, whereas the results by Zeltyn and Mandelbaum (2005) cannot 
capture the actual fractions. To achieve the above target, the simulation results recommend 205 
servers for = 3.0 and 206 servers for = 5.0, and the estimates by (2.6) suggest 205 and 207 
servers, respectively. In contrast, the results by Zeltyn and Mandelbaum (2005) yield 202 servers 
in both scenarios. If one follows this recommendation, the effective abandonment fraction will be 
5.55% for = 3.0 and 6.15% for = 5.0, failing to reach the performance objective in both cases. 


5 Diffusion limit of virtual waiting time 

This section is dedicated to the proof of Theorem A sequence of perturbed systems is introduced 


in Section 5.1 In Section 5.2, we first show that the perturbed systems are asymptotically equivalent 


to the original queues, then prove the diffusion limit for virtual waiting time processes in the 
perturbed systems, and finally finish the proof of the theorem by using the asymptotic equivalence. 
The proof procedure in Section 5.2 partially follows that of Theorem 1 in Huang et al. (2014). 


5.1 Perturbed systems 

Consider the virtual waiting time process in the nth queue. Let Wn,k be the offered waiting time of 
the fcth customer arriving after time zero, which is the amount of time the feth customer would wait 
until getting into service, provided that his patience is infinite. If we use (n,k to denote the patience 
time of the feth customer, the number of customers who arrived during (0, t] but will eventually 
abandon the system can be counted by 


E„(t) 

Ln{t) — 'y ^ ^{C,n,k<Wn,k}' 

k=l 

Suppose that all customers arriving after time t are rejected immediately. Then, the virtual waiting 
time at t turns out to be the amount of time from t until an idle server appears, i.e.. 


Wn{f) = inf{u > 0 : A„(0) + En{t) - Ln{t) - + n) < n}. 


(5.1) 


where Dn{t) is the number of service completions during (0,t]. Let an,k be the arrival time of the 
kth. customer. Because no two customers arrive at the same time, it follows from Lemma 3.2 in 


Dai and He (2010) that 


Wn,k — W^n{0‘n,k )■ 

All servers are almost always busy in the ED regime. For j = 1,..., n, because {fj,k ■ k G N} is 
the sequence of service times to be finished by the jth server, the service completion process from 
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this server is identical to the renewal process Nj until it begins to idle. Therefore, the departure 
process Dn is identical to the superposition of A^i,..., iV„ until the first idle server appears. Let 


Tn = inf{t > 0 : Wn{t) = 0} 

be the time that the first idle server appears. Then, > 0 because all servers are busy at time 
zero. The departure process satishes 

Dnit) = Bn{t) for 0 < t < Tn, 


where Bn{t) is given by (3.12). As the superposition of n iid stationary renewal processes. Bn is 
more analytically tractable than Dn- The equivalence between these two processes up to time Tn 
allows us to explore a perturbed system that has simplified dynamics. This perturbed system is 
asymptotically equivalent to the original queue as n goes large. 

Because Wn{t) > 0 for 0 < t < r^,, it follows from (5.1) that with probability one, 

A„(0) + En{t) - Ln{t) - Dn{t + Wn{t)) = n - 1 for 0 < t < T^. 

Because Dn{t + u) = Bn{t + u) for 0 < t and 0 < u < Wn{t), we further have 

Xn{0) + En{t) - Ln{t) - Bn{t + Wn{t)) = n - 1 for 0 < t < T^. 

Let us introduce a new process Vn by using a slightly modified dynamical equation 

A„(0) + En{t) - Rn{t) - Bn{t + Vn{t)) = n - 1 for t > 0, 

where 

En{t) 

Bn{t) ^ ^ ^{Cn.fe<KL(cin,*; —)} ■ 
k=l 


(5.2) 


By a standard sample path argument, one can show that (5.2) has a unique solution Vn- Clearly, 

(5.3) 


Wnit) = Vn{t) for 0 < t < Tn 

on each sample path, and Tn can thus be defined alternatively by 

Tn = inf{t > 0 : Vn{t) = 0}. 


(5.4) 


Taking t = 0 in (5.2), we obtain Bn{Vn{0)) = Xn{0) — (n — 1), which allows us to write (5.2) into 

Bnit + L)i(t)) = Bn{Vn{0)) + En{t) — Rnit)- (5-5) 

We refer to Vn as the virtual waiting time process in the nth perturbed system- The dynamical 
equation (5.5) is identical to equation (9) in Huang et al. (2014), while the latter is derived for their 
original system. 

The perturbed system can be envisioned as a queue where no server is allowed to idle. If a 
server finds no waiting customers upon a service completion, she begins to serve a customer who 
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has not arrived yet. In the perturbed system, all servers are always busy and the departure process 
from each server is a stationary renewal process. 


5.2 Asymptotic equivalence 


We first prove a fluid limit for the virtual waiting time processes in the perturbed systems. This limit 
allows us to establish the asymptotic equivalence between the original queues and the perturbed 
systems, which implies that these two sequences of systems have an identical diffusion limit. 

Lemma is a modified version of Theorem 4.1 in Pang et al. (2007), stating the continuity of a 
map defined by an integral equation. This map will be used extensively in the subsequent proofs. 


Lemma 1. For any / G D, let x be a function in D such that 


x{t) = f{t) - [ g{x{u))du, 
Jo 


(5.6) 


where : M —> 

X G D such that (|5.6|) holds. Let -0 


is Lipsehitz continuous with g{0) = 0. Then for each / G O, there is a unique 
^ D 6e the function that maps f to x. Then, ip is a 


continuous map when D ("as both the domain and the range) is endowed with the Ji topology. 


Write the dynamical equation (5.5) into the fluid-scaled form 


Bn{t + Vn{t)) — i?„(W(0)) + En{t) — Rnit), 


(5.7) 


where 


En{t) = —Enijnt), Bn{t) = —Bn{jnt), Rn{t) = —Rnilnt), Vn{t) = —I4(7ni)- (5.8) 

n-ln n'Jn njn In 


By (3.6), the fluid-scaled arrival process satisfies 

E„ 


ppe as n —>■ oo. 


(5.9) 


As a scaled version of the superposition of renewal processes, B^ satisfies a functional strong law of 
large numbers (FSLLN), as is stated in the proposition below. We leave the proof to the appendix. 

Proposition 1. Under the conditions of Theorem^^ 

Bn —>■ pe almost surely as n ^ oo. 


The following result states that in these perturbed systems, both the fluid-scaled virtual waiting 
time processes and their total variations are stochastically bounded. 

Lemma 2. Under the eonditions of Theorem^ 


and 


lim limsupP sup {t + Vn{t)) > a 
a^oo n^oo ^ 0<t<T 


= 0 


(5.10) 


lim limsupP 

a—>-oo n^oo 



|dl4(t)| > a 


0 for T > 0. 


(5.11) 
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The next lemma establishes an asymptotic relationship in the fluid scaling, allowing us to 
approximate the abandonment process by an integral of the virtual waiting time process. 

Lemma 3. Under the conditions of Theorem^ 

rt 


sup 

0<t<T 


Rn{t)-pfl H{Vn{u))du 

Jo 


0 for T > 0. 


Now we can prove the fluid limit for the virtual waiting time processes in the perturbed systems. 
Proposition 2. Under the conditions of Theorem\^ 

Vn => wx as n ^ oo. 


Proof. By (5.7), we obtain 


Vn{t) = t4(0) + In{t) + {p-l)t-p f H{Vn{u)) du, 

Jo 


where 


In{t) = -{BniVniO)) - pVn{0)) - -{Bnit + 14(t)) - p{t + I4(i))) + “ PPt) 

p p p 

--(Rn{t)-pp[ H{Vn{u))du). 

Jo ^ 


Then, Vn = '0(14(O)x + /„ + (p — l)e), where ip is the map defined by (5.6) with g[u) = pH{u) 
for tt G M. By (3.2), H is Lipschitz continuous with H{0) = 0, so 'i/’ is a continuous map. Because 
wx = Vi'diX {p ~ l)c)) Lemma[^and the continuous mapping theorem (see, e.g.. Theorem 5.2 in 


Chen and Yao (2001)) will lead to the assertion once we prove V^(0) w and In ^ 0 as n ^ oo. 


The convergence of Ki(0) follows from (3.7) and (5.3), and the convergence of In follows from 
( |5.9[ ), Proposition!^ and Lemmas [TJ|^ □ 

The fluid limit allows us to establish the asymptotic equivalence between the perturbed systems 
and the original queues. Put fn = Tnixn, which is the instant when the first idle server appears 
in the nth time-scaled perturbed system. The next proposition states that fn goes to infinity in 
probability as n increases. 

Proposition 3. Under the conditions of Theorem^ 

lim Pl-fn < T] = 0 for all T > 0. 

n^oo 


Proof. By (5.4) and (5.8), = inf{t > 0 : Vn{t) = 0}, which yields 


\fn<T]=F inf Vn{t) = 0 

lo<t<T 


Then, the assertion follows from Proposition!^ 


□ 
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Remark. By (3.11), (5.3), and Propositionsandwe obtain 

Wn wx as n —>■ oo, 


(5.12) 


which is the fluid limit for the virtual waiting time processes in the original queues. 

The diffusion-scaled virtual waiting time process in the nth perturbed system is defined by 


— \j 'Jn'w'). 

In 


(5.13) 


Then, 


Wn{t) = Vn{t) for 0 < t < Tn- (5-14) 

Proposition implies that Wn and Vn are asymptotically equal over any finite time interval. This 
allows us to consider the diffusion limit of Vn in order to obtain that of Wn- 

For notational convenience, write an,k = o,n,klln and Cn,kMn, which are the scaled arrival and 
patience times of the A:th customer. Put 

^ En{p(n^') 

^n{t) = ^ (l{Cn,fe<V„(a„,fc-)} - 0n(14(an,fc-)))- 

V k=l 


By (3.13), (5.8), and (5.13), we can write (5.5) into the diffusion-scaled form 


where 


and 


Vn{t) = Vn{0) + Mnit) - T„(t) - pfniw) / K(n) du, 


Mn{t) = ^{Bn{Vnm-Bn{t + Vn{t))-Rn{t)) + —En{t) 




(5.15) 

(5.16) 


Tn(t) = —^ {H{Vn{an,k-))-H{w))-pfH{w) Vn{u)du. (5.17) 

Jo 


Equations (5.15)-(5.17) follow equations (16)-(17) in Huang et al. (2014). 

To obtain the limit process of 1^, we should first obtain the limit processes of and T„. The 
convergence of can be deduced by the next lemma, which states the joint convergence of the 
diffusion-scaled arrival, service completion, and abandonment processes in the perturbed systems. 

Lemma 4. Under the conditions of Theorem^ 

(.Bn: Bn: Bn) — (.B: B: If) aS Tl ^ OO, 


where E, B, and R are independent driftless Brownian motions with variances ppc\, pel, and 
{p — l)p/p, respectively. 

The process turns out to be an error term converging to zero. 
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Lemma 5. Under the conditions of Theorem^ 


Tn => 0 os n —>■ oo. 

Using these convergence results, we can obtain the diffusion limit for the perturbed systems. 
Proposition 4. Under the conditions of Theorem^ 

Vn ^ W as n ^ oo, 


where W is the OU process given by (3.8). 


we 


Proof. By (3.7) and (5.14), 14(0) => 1U(0) as n —)• oo. Using Proposition and Lemma 
obtain Mn ^ M as n —?• oo, where M is a driftless Brownian motion with M(0) = 0 and variance 
(c„ + pCg + p — l)/{pp). Then, by Lemma 

Vn{0)x + Mn - T„ lU(0)y + M as n —)■ oo. 


Because 14 = il’{Vn{0)x + Mn — T„) and lU = '0(lU(O)y; + M), where is the map given by (5.6) 
with g{u) = pfH{w)u for tt G M, the assertion follows from Lemma[^and the continuous mapping 
theorem. □ 

Let us complete the proof of Theorem by using the asymptotic equivalence. 


Proof of Theorem\^ By (5.14), 


sup \Wn{t) — 14(01 > 0 < lP[r„ < T] for T > 0. 
0<t<T 


Then, Proposition [^implies that Wn — 14^0asn—)-oo. The theorem follows from Proposition]^ 
and the convergence-together theorem (see Theorem 5.4 in Chen and Yao (2001|)). □ 


6 Gaussian limit of queue length 


We prove Theorem in this section. With a general patience time distribution, it is difficult to 
prove a diffusion limit for the queue length process in the ED regime. Huang et al. (2014) proved 
a one-dimensional limit process of queue length process for the GI/M/n-|-GI model with a general 
scaling of patience-time distributions; they used the diffusion limit of virtual waiting time to obtain 
the limit queue length process. In the present study, we consider the marginal distribution of 
the queue length at a particular time, which also allows us to infer the steady-state queue length 
distribution. 


Let Qnit) = (Xnit) — n)~^ be the queue length in the nth system at time t. Because customers 
are first-come first-served, those who arrived before t must have either entered service or abandoned 
the system by t + Wn{t). In other words, customers who are waiting at t + Wn{t) must have arrived 
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during (i, t + Wn{t)] and have not abandoned the system by t + Wn{t). This observation yields 


Qn{t + Wn{t))= ^ l{Cn,fc>i+Vl^n(t)-a„,fe}- 

h=ErL 


For 0 < t < Tn, because Qn{t + Wn{t)) = Xn{t + Wn{t)) — n, it follows from (3.10) and (3.11) that 


Xnit + Wnit)) = _ {Qnilnt + Wn{'lnt)) “ Q'n), 


where qn is given by (3.9). Then, 

Eni'ynt+Wni'ynt)) 


Xn{t + Wn{t)) = 


1 


l{Cn,fe>7nt+W„(7nt)-a„,fe} - / {l-H{u))du. 

V fc=E„(7„t)+l 


By (5.8), we can further decompose the scaled queue length into 

Xn{t + Wn{t)) = ^Wn{t) + G'^{t) + &^{t) + Yn{t) + 


( 6 . 1 ) 


where 


rt-\-w rt+w 

G'nit) = yjn'yn / [I-H{t + w-u))dEn{u) - y/nrp^p^i I {I - H{t + w - u)) du, 

Jt Jt 


1 


En(nnt+W„(^nt)) 


^ {H{t + Wn{t) 


0-n,k) l{C„,fc<i+W„(t)-a„,fe})> 


G:;(t)= ^ 

= En{t + Wn{t)) - En{t + w) - / H{t + Wn{t) - u) dEn{u) 

J t+w 

pw 

+ {p- l)pWn{t) - y/nj^pp / [H{Wn{t) - u) - H{w - u)) du, 

h 

pt-\-w 

YLit) = ^ 

It 

[H{t + w- u) - H{t + Wn{t) - u)) du. 


{H{t + w - u) - H{t + Wn{t) - u)) dEn{u) 


To obtain the limit distribution of X^it + VFn(t)), we need to analyze each term on the right 
side of (6.1). The joint convergence of the first three terms is given by Lemma 


Lemma 6. Under the conditions of Theorem^ for any fixed t > 0, 

(lT„(t), G'Jt), G'f{t)) ^ {W{t), Gft), G"{t)) asn^oo, 


24 




















where W is the OU process given by (3.8) 


{1 — H{t + w — u)) dE{u), 

and 

pW 

G"{t)= / pfiH{w — u){l — H{w — u)) d5 {u). 

Jo 

Here, S is a standard Brownian motion that is independent of E, B, and {W{u) : 0 <u <t}. 

The other two random variables Yn{t) and Yf(t) are error terms converging to zero. 

Lemma 7. Under the conditions of Theorem^ for any fixed t >0, 

{Ynit),Yf{t)) ^ (0,0) as n- 1 - 00 . 

Using the previous two lemmas, we are ready to prove Theorem 
Proof of Theorem^ Put Gn{t) = + G'f{t) + Yn{t) + Yf{t). By Lemmasand 

{Wnit),Gnit)) ^ {W{t),G{t)) asn^oo, 

from which the convergence of Xn{t + Wn{t)) follows. □ 


7 Conclusion and future work 


A diffusion model was proposed for GI/GI/n + GI queues in the ED regime. We adopted a space- 
time scaling approach, in which the number of servers and the mean patience time are used as the 
respective scaling factors in space and time, to obtaining a one-dimensional diffusion approxima¬ 
tion for the virtual waiting time process. Using this diffusion model, we derived the steady-state 
distributions of virtual waiting time and queue length, along with approximate formulas for other 
performance measures. These approximations are generally accurate when the mean patience time 
is several times longer than the mean service time and the patience time distribution does not 
change rapidly around the mean virtual waiting time. 

One limitation of the diffusion model is as follows. As we discussed in Section the diffusion 


limit given by (3.8) depends on the normalized patience time distribution only through the hazard 


rate at w. In consequence, all approximate formulas derived from the diffusion model are dictated 
by the patience time hazard rate (or equivalently, the patience time probability density) at the mean 
virtual waiting time. These approximations will generally be accurate, if the patience time hazard 
rate does not change much around the mean virtual waiting time (as the case in Figure]^. In call 
centers, this assumption would be valid if waiting customers do not receive real-time information 
about the queue. However, if delay announcements are made in the call center, customers may 
decide whether to hang up according to what they hear about the queue. As a result, the patience 
time hazard rate may change rapidly after an announcement time. In this case, performance 
approximations that depend on the patience time distribution only through “a single point” may 
no longer produce satisfactory results. We would thus need a diffusion model that could incorporate 
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the patience time hazard rate on a neighborhood of the mean virtual waiting time. Such models 


were derived for GI/M/n + GI queues by Reed and Tezcan (2012) in the critically loaded regime 


and by Huang et al. (2014) in the ED regime. Furthermore, Huang et al. (2014) applied the 


results to the analysis of systems with delay announcement. In the future, we would also extend 
our diffusion model by including the entire patience time distribution. To this end, the current 
asymptotic framework needs to be modified so that both the space-time scaling and the hazard 


rate scaling used by Reed and Tezcan (2012) and Huang et al. (2014) can be combined into the 


same asymptotic framework. 


Appendix 


We prove Theorem is in Section |A.1| and prove all technical lemmas in Sections A.2 and A.3 


A.l Proof of the FCLT 

Let Sj^k be the /cth partial sum of f G N}, i.e., 

k 




e=i 


We take = 0 by convention. Let us first present the proof of Proposition 
Proof of Proposition^^ Because <t< 5'j Ar^(t)+i for f > 0, then 


Z]j=l ^j,Nj('ynt) 


< 


n^nt 


< 




E ■=! E- ilnt) - E •=! E- {int) ^ E'=1 E' ilnt) 
provided that Ej=i > 0. Note that 

n n n 

y^ o,*:- 

j=l j=\ j=l k=2 

Because lim^^oo = oo almost surely for f > 0, then 

"vn 


j=l 2^k=2 _ J- 


lim 

E,=lE'(7nt) 




almost surely by the strong law of large numbers. In addition, lim„_^oo Ej^=i ^j{lnt)/n = oo almost 
surely for f > 0, which implies that 


Therefore, 


lim " ^ = 0 

™ E]=lN,hnt) 

Ej=l ^j,Nj{'y„t)+l _ I 


lim 


n^oo Y3i=l^jilnt) IJ.' 
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Also, 


lini g=i = lim 


^ Nj{-fnt) n^oo j ijut) - 1 ) 


Ylj=l ^j,Nj{-/nt) Ylj=li^ji'lnt) 1 ) _ ^ 

ELl^i(7ni) /^' 


Then, linin^oo Bn{t) = fit almost surely for t > 0. Because Bn{t) is nondecreasing in t and e is a 
continuous function, the assertion follows from Theorem VI.2.15 in Jacod and Shiryaev (2002). □ 

Lemma 8. Let 

fort>0. (A.l) 

“ j=l k=2 


Under the conditions of Theorem^ 


BL 


B as n —)• oo. 


Proof. Let : k G N} be a sequence of iid random variables following distribution F. Put 


\nint\ 


Kit) = 


X] t > 0. 


k=l 


Because fiff^ has mean one and variance c^, by Donsker’s theorem, B'f =► B" as n —oo, where B" 
is a driftless Brownian motion with B"{U) = 0 and variance c^. By (5.8), 


n-ynBnit) 


NlUnt)^ -|-A''n(7nt) 


KiKit)) = 




E i^-t^^k) 


y/nPff 


E 

k=l 


i^-uQ- 


Then, it follows from Proposition!^ and the random time-change theorem (see Theorem 5.3 in Chen 


and Yao 


(2001)) that B'f ° Bn => fJ^t'^B" as n ^ oo. Because B'.^ has the same distribution as 
B'f o Bn and fi^t'^B" has the same distribution as B, the lemma follows. □ 

The next lemma is a technical result for proving the convergence of Bn- 

Lemma 9. Under the conditions of Theorem\^ for all 0 < r < s < t and all n G N, there exists 
0 < c < oo such that 


E[{Bnis) - Bnir))\Bn{t) - Kis))^] < c{t - rf. 

Proof. Let Nj{u) = Nj{u) — fiu for u > 0 and j = 1,... ,n. Because Nj is a stationary renewal 
process, by inequalities (7) and (8) in Whitt (1985), there exists ci < oo such that 

E[(V,-(s)-iV,(r))2]<ci(s-r) (A.2) 

and 


E[(iV,(s) - Kir))\Kit) - KiK^] <ci{t- rf 


(A.3) 


for all 0 < r < s < t. The regularity condition (3.4) is required for inequality (A.3) to hold. In 
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addition, it follows from (A.2) and Holder’s inequality that 


E[|iVj(s) - Nj{r)\\Nj{t) - iVj(s)|] < ci(s - < ci(t - r). 

Because A^i,..., Nn are iid processes, 


(A.4) 


E[(H„(s) - Bn{r)f{Bn{t) - Ms)f] = ^E[(lVi(7„s) - Ni{jnr))HNi{-/nt) - Ni{7ns)f] 

nii 

+ ^^E[(iVi(7„s) - iVi(7„r))2]E[(iVi(7„f) - iVi(7„s))' 

nX 


+ 


2(n- 1) 

nX 


E[(iVi(7„s) - iVi(7„r))(lVi(7„t) - iVi(7„s))]' 


< ci(t - r)^ + Ci(s - r){t - s) + 2cf (t - r)^, 

in which the inequality is obtained by (A.2)"(A.4). The lemma follows with c = 3cf + ci. □ 

Proof of Theorem\^ For j £ N, let 

Ujit) = - t (A.5) 

be the residual time at t > 0. Note that 17j(0) = fj^i. Because Ni,..., Nn are iid stationary 
renewal processes, Ui{t ),..., Unit) are iid random variables following distribution Ff, for all t > 0, 
each having mean 

TTT-e = / tdFJt) = -^ 

Jo ^ ^ 2^ 


and variance 


r*00 

X = I dFe{t) -mi = U I U dF{t) - mi 
Jo <3 Jo 


II 

,2 _ P / .3 


Then, cig < oo by (3.5). Put 


1 ^ 

Unit) = - 'YjiUjXnt) - m-e), 


i=i 


and we have E[C/„(i)^] = (Tg/ 7 „, —>■ 0 as n —)> oo. This implies that Unit) ^ 0 as n ^ oo for t > 0. 
By Theorem 3.9 in iBillingsley (1999), 


iUniti),... Xniti)) ^ 0 asn^oo 


(A.6) 


for any £ G N and 0 < ti < • • • < By (A.5), 

Ujit)=CjA+ ^ Cj,k-t = UjiO)+ ^ iCj^k-X^) +X^Njit)-t. 
k=2 k=2 


Then, by (3.13) and (A.l), we obtain 

Bnit) = —flUnid) + ^Unit) + B'ni^)' 


(A.7) 
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as n —>■ oo. 


We deduced from (A.6), (A.7), and Lemmaj^that 


Finally, it follows from Lemma and Theorem 13.5 in Billingsley (1999) (with condition (13.13) 
replaced by (13.14)) that B as n —>• oo. □ 


A.2 Proofs of lemmas for Theorem [T] 


Proof of Lemma\^ By (5.7), Bn{t + Vn{t)) < B„(W(0)) + En{T) for 0 < t < T. Because Bn have 
nondecreasing sample paths. 


sup {t + Vn{t)) > a < F[Bn{a) < B„(14(0)) + En{T)] for a > 0. 
0<t<T 


By (3.7), (5.3), and PropositionB„(V)i(0)) fj,w as n —7- oo. Then, ( |5.10[ ) follows from ( |5.9[ ). 

Because e + Vn has nondecreasing sample paths (see Lemma 3.3 in Dai and He (2010)), we have 
{Vn{t2) - Vniti))~^ < {t 2 + 14(^2)) “ (H + 14(ii)) and ( 14 (^ 2 ) - Vnitl))~ < t2 “ H for 0 < ti < t 2 . 
Hence, the total variation of Vn over [0, T] satisfies 


\dVn{t)\<2T + Vn{T)-VniO). 


Then, (5.11) follows from (5.10). 


□ 


Martingale arguments are extensively involved in subsequent proofs. Let us define the associated 
filtrations. In the reth system, let Vn,k be the service time of the A:th customer arriving after time 
zero. Note that Vn,k is not identical to f,n,k, since the latter is the kth service time finished by the 
nth server. To keep track of the history of the queue, we define a filtration {J4,i : i = 0,1,...} by 

En,i — (^{0‘n,k+l^ '^n,k : Cn,k ■ k ^ i}, 


where an,k and (n,k are the arrival and patience times of the kth customer. By Lemma 3.1 in Dai 


and He (2010), Vn{an,k—) is J4,i-nieasurable for all A; < i + 1. Modifying the above filtration, we 
construct a continuous-time filtration {J4(t) : t > 0} by 


Fn{t) = 


n,[n7„q • 


Proof of Lemma\^ Put 


[n'yntl 


^>^(4 — X/ (^{Cn,fc<V7(a„,fe-)} ^n{Vn{an,k )))■ 


nyn 


k=l 


Then, {(Z„(t), J4(A)) : t > 0} is a martingale with quadratic variation 

[n-ynt] 


[Zn]{t) = ^ “ 0n(14(an,fc-))) 


k=l 
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(see Lemma 4.2 in Dai and He (2010|). Clearly, lim„^oo[-^n](i) = 0 for t > 0. We also have 


lim sup \AZn{t)\ = 0 
n^oo o<t<T 


because |AZ„(f)| < l/(n7„). Then, it follows from the martingale FCLT (see Theorem 7.1.4 in 


Ethier and Kurtz (1986)) that => 0 as n —>■ oo. By (5.9) and the random time-change theorem, 

(A.8) 


Zn o En ^ 0 as n 


oo. 


Put 


(i) 

^n(t} = ^ 0n(Vn(an,k-)) and f„(f) 

k=l 


— rn(7ni)- 
n-in 


By (3.3) and (5.8), r„(t) can be written into a Riemann-Stieltjes integral 


fn(t) 



H{Vn{u-))dEn{u). 


Using integration by parts, we obtain 


Tnit)-pfJ, H{Vniu))du = (Enit) - pfJ,t)H{Vn{t)) - / {En{u) - ppu) dH(Vniu)). 

Jo Jo 


By (5.9), the first term on the right side satisfies 


sup \{En{t) - ppt)H{Vn{t))\ => 0 as n —)• oo. 

0<t<T 


For the second term, it follows from (3.2) that 


sup 

0<t<T 


(Eniu) - ppu) dH{Vn{u)) 


< sup \En{t)-ppt\ [ \dH{Vn{t))\ 

0 <t<T Jo 


< K sup \En{t) — ppt\ 
0<t<T 



\dVnit)\. 


By (5.9) and (5.11), 


sup 

0<t<T 


{En{u) - ppu) dH{Vn{u)) 


as n —oo. 


It follows that 


sup 

0<t<T 


rn{t)-pp H{Vn{u))du 
Jo 


0 as n —)• oo. 


Because Rn{t) = Zn{En{t)) + r„(f), the lemma follows from (A.8) and (A.9). 


(A.9) 

□ 
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Proof of Lemma^ Put 


^ ^ (^{Cn,fc<Pn(a„.fc-)} “ H{Vn{an,k-))) 


and 


\ri'ynt\ 

»W = 775^1 E 


fc=l 


Clearly, P[Cn,fc < w)] = H{w) = {p — l)/p for n, A; G N. Then, =► Z as re —)• oo by Donsker’s 
theorem, where Z is a driftless Brownian motion with variance {p — P)!(?. Because En, Bn, and 
Z'n are mutually independent, it follows from (3.6) and Theoremthat 


{En,Bn,Zlf} ^ {E,B,Z) asre-^oo, 
where E, B, and Z are also mutually independent. 


(A.IO) 


By Lemma 4.2 in Dai and He (2010), {(Z„(t) — Z^(t),T'„(t)) : t > 0} is a martingale with 

, [n-ynt] 


k=l 


Because (n,k is independent of En^k-i and Vn{an^k—) is Tn,A:-i-™easnrable, 

v2 


E 


(^{Cn,fc<V'„(a„,fc-)} l{Cn, H{Vn{an,k ))) 

= H{Vn{an,k-)) + H{w) - 2H{Vn{an,k-) Aw)- (H(re;) - H(K«fc-)))' 
< H{Vn{an,k-)) + H{w) - 2H{Vn{an,k-) A w) 

= \H{Vn{an,k-)) - 


Hence, 


1 


[n-ynt] 


E[[Zn - Z'n]{t)] < - ^ E[\H{Vnian,k-)) - H{w)\] <t-E 

k=l 


snp \H{Vn{an,k-)) - H{w)\ 

*- l<k<n'ynt 


For any e > 0 and t > 0, it follows from (3.2) that 


sup \H{Vnian,k-)) - H{w)\ > e 

'■ l<A:<n7„t 


< 


< 


< 


sup \Vn{an,k-) - W\ > - 
^ l<k<n‘ynt 


sup \Vn{an,k-) -w\>- +¥[Enip Jut) < njnt] 
l<k<En{fJ,-^'ynt) 

sup \Vn{u) - w\ > - +¥[En{p~^t) < t]. 
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Then, we deduce from (5.9) and Proposition that 


lim P 

n^oo 


sup \H{Vn{an,k-)) - H{w)\ > e 

l<k<n')nt 


= 0 . 


Because \HiVnian^k—)) — H{'w)\ < 1, we further obtain 


lim E 

n^oo 


sup \H{Vn{an,k-)) - H{w)\ 

l<k<n'ynt 


= 0 . 


Then, E[[Zn — Z!^]{t)~\ —)• 0 and thus [Zn — Z'j^]{t) => 0 as n —oo for all t > 0. In addition, because 
\A{Zn — Z'j^){t)\ < 2jy/rir^, it follows from the martingale FCLT that => 0 as n —>■ oo. 


which, along with (A. 10) and the convergence-together theorem, implies that 

Z^'j B, Z^ as n ^ oo. 


Because Rn = ZnoEn, we conclude the proof using (5.9) and the random time-change theorem. □ 
Proof of Lemma\^ Put 

Eni'ynt) 

[H{Vn{an,k-)) - H{w)) - (HiVniu)) - H{w)) du. 


By (5.8) and integration by parts, 

rt 


T'Jt) = [ {H{Vn{u-)) - H{tD)) dEniu) - V^/9 [ {H{Vn{u)) - H{w)) du 

Jo Jo 

= - H{w)){En{t) - Pilt) - f {En{t) - ppt) dH{V^{u)) 




Jo 


= -{H{Vn{t))-H{w))En{t)- - [ Eniu-)dH{Vn{u)) + - V AK(u)AF(K(u)). 


0<u<t 


Consider the three terms on the right side. By (3.6) and Proposition]^ 


sup \{H{Vn{t)) - H{w))En{t)\ 0 as n —)■ oo. 

0<t<T 


Using the continuous mapping theorem and Theorem 3.9 in Billingsley (1999), we obtain 


{En, H o Vn) ^ {E, H{w)x) asn->oo. 

By (3.2), Jq* \dH{Vn{u))\ < k Jq |di4('u)|. Then, it follows from (5.11) that 


n—^oo ^ Jo 


lim limsupP / \dH{Vn{t))\ > a 


= 0 , 


(A.ll) 


which implies that {H o 14i : n G N} is uniformly tight (see Definition 7.4 in Kurtz and Protter 
(1996) for the definition of uniform tightness). Because Jq H{w)E{u—) dx{u) = 0 for all t > 0, it 
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follows from Theorem 7.10 in Kurtz and Protter (|1996|) that 

0 as n —)• oo. 


sup 

0<t<T 


En{u-) dH{Vn{u)) 


In addition, the third term satisfies 


sup I ^ AEnit)AH(ynit)) 


0<t<T 


< sup \AEnit)\ / \dH{Vn{t))\. 

0 <t<T Jo 


By (3.6), (A.11), and the fact that E has almost sure continuous sample paths, 

^ AEn{t)AH{Vn{t)) 


sup 
0<t<T 


0 as re —>■ oo. 


Therefore, T' 0 as re 


oo. 


Rewrite (5.15) into 


Vn{t) = K(0) + Mn{t) - - .JrEfnP / {H{Vn{u)) - H{w)) du. 

Jo 


It follows from (3.2) and (5.13) that 


\Vn{t)\ < |K(0) + Unit) - %{t)\ + / \H{Vn{u)) - H{w)\du 

Jo 

< |I4(0) + Mn{t) - t;(t)| +KP [ |K(n)| dre. 

Jo 


Using Gronwall’s inequality (see Lemma 21.4 in Kallenberg (2002)), we obtain 

\Vn{t)\ < |K(0) + Mnit) - T'„(t)| exjp{Kpt). 


By (3.7), (5.14), (5.16), Lemma 4 and the convergence of T(j, we have 


lim limsupP 


a—>-oo n—>-cx ^0<t<T 


sup |K(0) + Mn{t) - T'„(t)| > a 


= 0 , 


which implies that {Vn ■ re G N} is stochastically bounded. 

Put 

{t — w)~^{H{t) — H{w)) for t / re), 
fniw) for t = w. 


pit) = 


By (5.8) and (5.13), 

rt 


^/nr^ {H{Vn{u))-H{w))du-fHiw) I4(re) dre = / ((/?(I4(re)) -///(re;))l4(re) dre. 
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Because lim^-,.^ (/?(re) = fniw) and {Vn : re G N} is stochastically bounded, it follows from Propo- 
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sition [2] that 

sup ^/n'yn / {H{Vn{u)) — H{w)) du — fH{w) / Vn{u)du => 0 as n —)> 00 , 
o<t<T Jo Jo 

which, along with the convergence of T^, leads to the assertion of the lemma. □ 


A.3 Proofs of lemmas for Theorem [2] 

Proof of Lemma^ Using integration by parts, we obtain 

/ t+lD 

En{u) dH{t + w - u) + Enit + w) - {I - H{w))Enit). 


It follows from (3.6) that 

rt-\-'W 


G'n{t) ^ J E{u) dH{t + w — u) + E{t + w) — {1 — H{w))E{t) as n —)> 00 . 


Using integration by parts again, we have 

pt-\-w 


rt-\-w 


/ E{u)dH{t + w-u)+E{t + w)-{1-H{w))E{t) = {1 - H{t + w - u)) dE{u). 

Jt Jt 

Then, ^ G'{t) as n —>• 00 . 

Write < and = Cn,fc+i?„( 7 „t), which are the arrival and patience times of 

the A:th customer arriving after respectively. Put 

^ \n')nu\ 


k=l 


with bn^k = k/^pfj.n'jn)- Then, Kn is a martingale with quadratic variation 


[n7„uj 

k=l 


Because 


E[{H{w - bn,k) - = H{w - &n,fc)(l - H{w - bn,k)), 


by the weak law of large numbers (see, e.g.. Theorem 5.14 of Klenke (2014)), we obtain 


ru/pn 

[Kn]{u) H{w — s)(l — H{w — s)) ds. 

Jo 


Since \AKn{u)\ < l/y^rryj^ for u>0, we further have 


lim sup |AA„(ri)| =0 for T > 0. 

n^QOQ<u<T 
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Then, it follows from the martingale FCLT that Kn ^ K as n —)■ oo, where 


Put 


fu/pii _ ^ 

K{u) = / pfiH{w — s)(l — H{w — s)) diS(s). 

Jo 


^ \n-i„u\ 


/n 


k=l 


By similar arguments as in the proof of Lemmawe can prove that — K'n 0 as n —oo. 

By (5.15), Lemmas and Proposition]^ and the asymptotic equivalence between Vn and 
Wn, we deduce that (.E„,1T„) ^ {E,W), so that {Wn,G'^) => {W,G') as n ^ oo. Because Kn is 
independent of (Wn(t), G^(t)), we further obtain 

iWn{t)x, &nit)x, Kn) => {W{t)x, G'{t)x, K) as n oo, 

where W{t), G{t), and K are mutually independent. Because iL„ — K'n 0 as n —)• oo and 

G':,{t)=k'n{En{t + Wn{t))-En{t)), 


the assertion of the lemma follows from (5.9), (5.12), and the random time-change theorem. □ 


Proof of Lemma\^ Since En ^ E and E has almost sure continuous sample paths, then by (5.12), 

En{t + Wn{t)) — En{t -|- u)) =i> 0 as n —)> oo. 

Because 


y/nXn 


rt+w^(t) 


' t-\-w 


H{t + Wn{t) -u)dEn{u) < H{\Wn{t) - w\)\Wn{t)[En{t + Wn{t)) - En{t + w))\, 


it follows from (5.9), (5.12), and Theoremthat 


pt-f-Wnit) _ _ 

y/nxf) / H{t + Wn{t) — u) dEn{u) 0 as n —>■ oo. 

J t-\-w 


Write 


^{u,6) = 


6 ^{H{u + 6) — H{u)) for (5 / 0, 


^fniu) for (5 = 0. 

Then, lim^^o p{u, S) = fniu) for u > 0. Using the fact that H{w) = {p — 1)/p, we obtain 

rw 

{p - l)pWn{t) - y/rvffpp / {H{Wn{t) - u) - H {w - u)) du 

Jo 

pw 

= ppWnit) / ifniw -u) - p{w - u, Wn{t) - w)) du. 

Jo 

By (3.2), we have \fH{w — u) — ip{w — u,Wn{t) — u))| < 2k. It follows from (5.12), Theorem]^ and 
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the dominated convergence theorem that 


{p — l)ljWn{t) — / {H{Wn{t) — u) — H{w — u)) du ^ Q as n —)■ oo. 


Hence, Yn{t) => 0 as n —)• oo. 

Put Hn{u) = H(t + w — u) — H{t + Wn{t) — u) for n > 0. By (5.12) and Theorem 3.9 in 
Billingsle'^ (1999), we obtain 

{En, Hn) => {E, 0) as n —>• oo. 


By similar arguments as in the proof of Lemma we can show that {Hn : n G N} is uniformly 
tight. Using integration by parts, we obtain 




En{u) dHn{u) + Hn{t + w)En{t + w) - Hn{t)En{t). 


Then, it follows from Theorem 7.10 in Kurtz and Protter (1996) that 

pt-\-w 

/ En{u) dHn{u) =► 0 as n ^ oo, 


which, along with (3.6), implies that Y^{t) 0 as n —)• oo. The joint convergence follows from 

Theorem 3.9 in Billingsley (1999). □ 
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